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Abstract — This  paper  describes  the  in-field  operation  of  two 
interacting  autonomous  marine  vehicles  to  demonstrate  the 
suitability  of  Interval  Programming  (IvP),  a  novel  mathematical 
model  for  multiple-objective  optimization.  Broadly  speaking, 
IvP  coordinates  competing  control  needs  such  as  primary  task 
execution  that  depends  on  a  sufficient  position  estimate,  and 
vehicle  maneuvers  that  will  improve  that  position  estimate.  In  this 
work,  vehicles  cooperate  to  improve  their  position  estimates  using 
a  sequence  of  vehicle-to-vehicle  range  estimates  from  acoustic 
modems.  Coordinating  primary  task  execution  and  sensor  quality 
maintenance  is  a  ubiquitous  problem,  especially  in  underwater 
marine  vehicles.  This  work  represents  the  first  use  of  multi¬ 
objective  optimization  in  a  behavior-based  architecture  to  address 
this  problem. 

I.  Introduction 

This  work  posits  a  solution  to  the  problem  of  controlling 
an  autonomous  marine  vehicle  where  control  requirements  are 
conflicted  between  primary  task  execution  and  maintaining  a 
sufficiently  accurate  position  estimate  to  facilitate  primary  task 
execution. 

We  are  motivated  by  the  following  scenario:  two  heteroge¬ 
nous  vehicles  are  in  operation,  the  first  has  a  “survey  at  depth” 
role  while  the  second  shuttles  location  information  from  the 


Fig.  1.  Two  heterogeneous  unmanned  marine  vehicles  are  in  operation. 
The  first  has  a  “survey  at  depth  ”  role  while  the  second  shuttles  location 
information  from  the  surface  (GPS  fixes)  to  the  survey  vehicle  below  and 
transports  salient  survey  information  from  the  first  vehicle  to  the  surface. 
Control  of  the  “survey”  vehicle  is  balanced  between  the  need  to  survey,  and 
the  need  to  update  the  quality  of  its  position  estimate.  Position  updates  come 
in  the  form  of  a  sequence  of  range-only  communications  via  acoustic  modem. 

surface  (GPS  fixes)  to  the  survey  vehicle  below  and  transports 


salient  survey  information  back  to  the  surface. 

Data  communications  between  the  vehicles  is  by  way  of  an 
acoustic  modem  which  can  also  provide  both  one  way  and 
two  way  time  of  flight  measurements  between  the  vehicles.  A 
single  instance  of  the  latter  —  a  “ping” —  provides  range-only 
information  constraining  the  relative  locations  of  the  vehicles 
to  lie  on  an  approximately  spherical  manifold.  Initiating  a 
sequence  of  pings  from  a  well  chosen  sequence  of  locations 
(for  example  non-colinear  when  one  vehicle  is  static)  leads  to  a 
unique  solution  for  the  relative  cartesian  positions  between  ve¬ 
hicles.  Control  of  the  survey  vehicle  is  composed  of  primarily 
two  vehicle  behaviors  in  a  behavior-based  control  architec¬ 
ture.  We  utilize  multi- objective  optimization  via  the  interval 
programming  (IvP)  model  [1]  for  coordinating  behaviors  and 
discuss  why  this  method  of  behavior  coordination  overcomes 
a  well-known  drawback  of  behavior-based  systems. 

The  problem  of  controlling  a  robot  or  autonomous  vehicle 
in  a  manner  that  effectively  balances  task  execution  and 
maintaining  accurate  sensor-derived  beliefs  of  its  world  is 
a  ubiquitous  problem  not  exclusive  to  marine  vehicles,  but 
is  particularly  prevalent  in  underwater  vehicles.  This  work 
represents  the  first  use  of  IvP  in  a  behavior-based  architecture 
to  effectively  address  this  problem.  We  present  experimental 
validation  of  the  architecture  using  autonomous  kayaks  and 
acoustic  modems. 

II.  Background 
A.  Behavior-Based  Control 

In  behavior-based  systems,  robot  or  vehicle  control  is  the 
result  of  set  of  independent,  specialized  modules  working  to¬ 
gether  to  choose  appropriate  vehicle  actions.  It  can  be  viewed 
as  an  alternative  to  the  traditional  sense-plan-act  control  loop 
as  shown  in  Fig.  2,  where  decision-making  and  planning 
are  performed  on  a  single  world  model  that  is  built  up  and 
maintained  over  time. 

Commonly  cited  virtues  of  behavior-based  systems  include: 
the  ease  of  development  of  the  independent  modules,  the  lack 
of  a  single  complex  world  model,  and  the  potential  for  a 
highly  reactive  vehicle  with  certain  behaviors  triggered  by  the 
appropriate  events  in  a  dynamic  environment. 

The  origin  of  such  systems  is  commonly  attributed  to 
Brooks’  “subsumption  architecture”  in  [2].  Since  then,  it  has 
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Fig.  2.  Behavior-based  control  differs  from  conventional  control  by  com¬ 
posing  overall  vehicle  behavior  into  distict  modules  that  are  developed  and 
operate  largely  in  isolation,  and  coordinated  through  an  action  selection 
mechanism.  In  this  case,  action  selection  is  in  the  form  of  a  new  multi¬ 
objective  optimization  technique  to  overcome  known  difficulties  associated 
with  behavior-based  control. 


been  used  in  a  large  variety  of  applications  including:  indoor 
robots,  e.g.,  [3],  [4],  [5],  [6],  [7],  [8],  [9],  [10],  [11],  land 
vehicles,  e.g.,  [12],  planetary  rovers,  e.g.,  [13],  [14],  [15],  and 
marine  vehicles,  e.g.,  [16],  [17]  [18],  [19],  [20],  [21],  [22]. 

Action  selection,  as  indicated  in  Fig.  1,  is  the  process  of 
choosing  a  single  action  for  execution,  given  the  outputs  of 
the  behaviors.  The  “action  space”  is  the  set  of  all  possible 
distinct  actions.  For  example,  all  combinations  of  rotational 
and  angular  velocity  for  a  robot,  or  all  speed,  heading  and 
depth  combinations  for  a  marine  vehicle. 

B.  Known  Difficulties  in  Behavior-Based  Control 

The  primary  difficulty  often  associated  with  behavior-based 
control  concerns  action  selection  -  namely  how  to  ensure  the 
chosen  action  really  is  in  the  best  overall  interest  of  the  robot 
or  vehicle.  An  action  generally  is  a  vector  of  values,  one  for 
each  actuator  being  controlled.  For  example,  the  rotational  and 
angular  velocity  for  a  land  robot,  or  heading,  speed  and  depth 
for  a  marine  robot. 

Generally  there  are  two  techniques  used  in  practice.  The 
simplest  method  is  to  pick  (at  every  iteration  of  the  control 
loop)  a  single  behavior  to  have  exclusive  control  of  the  vehicle. 
Some  approaches,  like  [18],  [2],  [23]  assign  a  set  of  fixed 
priorities  to  behaviors,  and  conditions  for  their  activation.  The 
priorities  do  not  change  dynamically.  In  other  implementa¬ 
tions,  like  [20],  priorities  may  be  determined  dynamically. 
Although  appealing  due  to  its  simplicity,  it  is  problematic  in 
applications  where  the  outright  ignoring  of  the  “secondary” 
behaviors  leads  to  gross  vehicle  inefficiency,  as  is  the  situation 
with  task  described  in  this  work. 

The  other  common  form  of  action  selection,  known  variably 
as  “action  averaging”,  “vector  summation”  etc.,  takes  the 
output  of  each  behavior  in  the  form  of  a  vector  and  uses  the 
average  numerical  value  as  the  action  sent  to  the  vehicle’s 
actuators.  Summation  is  typically  weighted  to  reflect  behavior 
priority.  This  method  has  been  used  effectively  in  a  number 
of  applications,  [3],  [24],  [25],  [19],  [26]. 

When  the  preferred  actions  of  two  distinct  behaviors  dis¬ 
agree,  this  approach  rests  on  the  idea  that  the  alternative  ac¬ 
tions  degrade  in  effectiveness  in  a  manner  depicted  in  Fig. 3.  In 
such  a  case,  the  action,  or  actuator  setting,  in  between  the  two 
individually  preferred  actions  may  indeed  be  the  most  effective 


Fig.  3.  In  action-averaging,  each  behavior  outputs  a  single  best  action.  The 
best  action  presumably  is  the  most  effective  among  alternative  actions  for  that 
particular  behavior.  The  effectiveness  levels  of  alternative  actions  are  rendered 
here  only  for  illustration  and  do  not  participate  in  the  action  averaging  process. 
When  two  behaviors  are  non-mutually  exclusive  and  share  common  action 
choices  with  high  levels  of  effectiveness,  as  shown  here,  then  action  averaging 
typically  reflects  an  appropriate  compromise  between  behaviors. 

action  overall.  However,  action  averaging  is  problematic  in 
cases  when  alternative  actions  degrade  in  effectiveness  in  a 
manner  depicted  in  Fig.  4,  where  the  numerical  average  does 
not  represent  an  effective  compromise  between  two  behaviors 
that  are,  in  effect,  mutually  exclusive. 


Fig.  4.  The  average  of  the  best  action  produced  by  two  behaviors  may 
have  poor  value  for  both  behaviors.  The  chooser  of  the  action  is  oblivious 
to  the  error  since  the  behaviors  output  a  single  preferred  action  and  not 
communicated  the  underlying  effectiveness  of  their  alternatives,  rendered  here 
only  for  illustration.  In  this  case,  the  interests  being  pursued  by  the  two 
behaviors  are  mutually  exclusive,  and  the  “compromise”  is  detrimental  to 
both. 

Even  in  cases  where  two  behaviors  are  not  mutually  ex¬ 
clusive,  where  there  do  indeed  exist  actions  that  represent  a 
good  compromise,  as  shown  in  Fig.  5,  action  averaging  can 
not  guarantee  the  selection  of  the  right  action. 

In  this  work,  such  situations  are  typical  between  vehicle 
behaviors,  and  a  new  method  of  multi-objective  optimization 
is  applied  to  action  selection  to  overcome  this  problem. 

III.  The  “IvP”  Architecture 

Here  we  describe  the  use  of  multi-objective  optimization 
with  interval  programming  and  the  two  primary  behaviors  used 
in  this  experiment  comprising  task  execution  and  maneuvers 
to  update  vehicle  position  estimate. 

A.  Behavior-Based  Control  with  Interval  Programming 

By  using  multi-objective  optimization  in  action  selection, 
behaviors  produce  an  objective  function  rather  than  a  single 
preferred  action  (  [7],  [12]).  In  the  examples  in  Figs.  3  through 
5,  the  objective  functions  are  what  distinguish  opportunities  for 
compromise.  Note  the  pair  of  preferred  actions  in  Figs.  3  and 


Fig.  5.  A  multi-modal  behavior  is  one  where  alternative  actions  exist  that 
have  high  utility  but  actions  “in  between”  have  lower  utility.  For  example  a 
behavior  where  turning  hard  left  or  right  is  useful,  but  going  straight  is  bad. 
In  such  situations,  if  only  the  single  best  action  from  each  behavior  is  output, 
opportunities  for  effective  compromise  between  two  non-mutually  exlusive 
behaviors  is  missed  by  simply  averaging  the  two  elite  actions. 


4  are  virtually  the  same  despite  the  differences  in  utility  of 
secondary  alternatives. 

There  are  two  practical  challenges  in  handling  objective 
functions:  (1)  the  method  must  be  fast  enough  to  accommodate 
the  vehicle  control  loop,  typically  l-20Hz.  On  each  iteration 
new  functions  are  created  and  a  new  problem  solved  (speed 
is  a  primary  advantage  of  action  averaging).  (2)  the  method 
cannot  be  overly  restrictive  on  objective  function  form.  Non- 
convex,  multi-modal  functions  are  quite  common. 

The  IvP  model  specifies  (1)  a  scheme  for  representing 
functions  of  unlimited  form  and  (2)  a  set  of  algorithms  for 
finding  the  globally  optimal  solution.  All  functions  are  piece- 
wise  linearly  defined,  thus  they  are  typically  an  approximation 
of  a  behavior’s  true  underlying  utility  function.  Search  is  over 
the  weighted  sum  of  individual  functions  and  uses  branch 
and  bound  to  search  through  the  combination  space  of  pieces 
rather  than  the  decision  space  of  actions.  The  only  error 
introduced  is  in  the  discrepancy  between  a  behavior’s  true 
underlying  utility  function  and  the  piecewise  approximation 
produced  to  the  solver.  This  error  is  preferable  compared  with 
the  error  of  restricting  all  behaviors  to  a  quadratic  function 
for  example.  Furthermore,  the  search  is  much  faster  than  brute 
force  evaluation  of  the  decision  space,  as  done  in  [12].  The 
decision  regarding  function  accuracy  is  a  local  decision  to 
the  behavior  designer,  who  typically  has  insight  into  what  is 
sufficient.  The  solver  guarantees  a  globally  optimal  solution 
and  this  work  validates  that  such  search  is  feasible  in  a  vehicle 
control  loop  of  4Hz  on  a  600MHz  computer. 

To  enhance  search  speed,  the  initial  decision  provided  to 
the  branch  and  bound  algorithm  is  the  output  of  the  previous 
cycle,  since  typically  what  was  a  good  thing  to  do  a  fraction 
of  a  second  prior,  is  not  a  bad  thing  at  the  present.  (In  fact, 
when  something  does  change  dramatically  in  the  world,  such 
as  hitting  a  way-point,  the  solve  time  has  been  observed  to 
be  roughly  50%  longer,  but  still  comfortably  under  practical 
constraints). 

B.  Behavior  for  Executing  a  Survey  Task 

The  survey  behavior  drives  the  vehicle  to  a  sequence  of  way- 
points  at  a  preferred  speed.  The  produced  objective  function 


ranks  candidate  maneuvers  with  a  utility  that  degrades  lin¬ 
early  w.r.t.  the  candidate  maneuver’s  delta  from  the  preferred 
heading  and  speed,  as  shown  in  Fig.  6.  When  this  behavior  is 


Fig.  6.  The  survey  behavior  produces  an  objective  function  with  peak  utilities 
at  the  preferred  heading  and  speed  that  would  result  in  a  traversal  to  the  next 
area  of  interest  in  the  survey.  The  utilities  degrade  linearly  with  the  delta  of  a 
candidate  decision.  The  utility  function  is  parameterizable  at  mission  planning 
time.  The  function  is  produced  from  scratch  at  each  iteration  of  the  control 
loop  to  reflect  the  new  relative  position  of  the  vehicle  to  the  area  of  interest 
in  the  survey. 

run  in  isolation,  it  results  in  an  un-conflicted  set  of  maneuver 
legs  comprising  a  survey  as  shown  in  Fig.  7.  This  particular 
survey  mission  pattern  also  represents  the  survey  executed  in 
our  experimental  results  (alongside  another  behavior)  shown 
in  Fig.  8. 


Fig.  7.  The  survey  behavior,  if  run  in  isolation,  will  execute  a  series  of 
legs  governed  by  a  set  of  way-points.  The  prevailing  objective  function  ranks 
candidate  maneuvers  based  on  the  next  way-point.  The  objective  function  is 
shown  here  defined  over  candidate  heading  and  speed  where  the  darker  points 
represent  more  preferred  decisions. 


C.  Behavior  for  Resolving  Position  Uncertainty 

We  now  describe  the  design  and  implementation  of  a 
behavior  that  ensures  spatial  diversity  of  ping-locations  (the 
locations  from  which  acoustic  range  information  is  requested 
via  the  acoustic  modems).  The  goal  is  to  enable  the  survey 
vehicle  to  gather  enough  information  to  locate  (from  a  least 
squares  adjustment  or  otherwise  on  a  bundle  of  range  mea¬ 
surements)  the  nav-aid  vehicle  which  may  or  may  not  be 
moving.  Using  the  entirety  of  the  recorded  ping- set  is  not  a 
reasonable  option  given  the  latter  possibility.  This  motivates 
making  decisions  in  the  light  of  a  previous/local  ping  history. 
We  label  the  set  of  the  last  n  locations  (out  of  N  total  pings) 
at  which  range  information  was  recorded  between  the  two 
vehicles  =  [Xj^X^-i  •  •  •  Xn-u]t .  We  are  now  faced 
with  deciding  where  the  next  ping  should  occur  from.  This  is 
a  function  of  the  shape  of  the  locations  in  Dn  both  in  terms 
of  their  collective  spatial  extent  and  diversity.  Consider,  for 
example,  the  case  where  the  points  in  Dn  form  a  line.  It  may 
be  that  this  line  is  generally  broadside  to  the  nav-aid  vehicle 
and  naturally  yields  a  tight  solution.  However  a  trajectory 
broadly  co-linear  with  the  nav-aid  vehicle  will  lead  to  a  poorly 
constrained  solution.  We  adopt  a  motion  governing  behavior 
that  seeks  to  produce  spatially  diverse  set  of  measurement 
origins.  The  idea  is  to  examine  the  principal  directions  of  a 
tentative  point  set,Dn/,  formed  by  augmenting  Qn  with  a  new 
tentative  observation  point  X/v+i-  We  proceed  by  forming  the 
2x2  covariance  matrix  ,  C,  of  this  point  set  and  define  the 
minimum  and  maximum  eigen- values  of  C  as  A-  and  A+ 
respectively.  The  condition  number  of  C,  the  ratio  is  an 
indication  of  the  circular  symmetry  of  Qnr  —  being  unity 
for  a  perfect  circle  and  zero  for  a  perfectly  co-linear  point 
set.  We  also  wish  to  take  into  account  the  size,  or  spatial 
extent  of  the  point  set.  Consider  the  case  where  Qn*  is  small, 
almost  round  cluster  (close  to  coincident)  we  do  not  wish 
to  place  a  high  utility  in  choosing  a  new  putative  observing 
point  that  would  invoke  no  or  little  change  in  view  point  even 
if  it  would  result  in  a  Qnr  with  polar  symmetry.  Instead  we 
would  prefer  to  undertake  a  significant  excursion  away  from 
the  original  cluster  despite  the  resulting  reduction  in  We 
encapsulate  this  preference  by  weighting  the  ratio  by  a 
sigmoid  function.  This  leads  us  to  an  expression  of  the  utility 
u(XN+ 1)  of  visiting  a  new  point  XN+1  =  (xN+1,yN+i) 
given  a  recent  history  of  N  measurement  locations  D  n  • 

u(XN+i)  =  Utanh(A  -  -)  (1) 

here  a  and  (3  parameterize  the  “break  point”  and  knee  width 
of  the  sigmoid  function  respectively.  The  experimental  results 
presented  in  this  paper  use  marine  surface  craft,  so  Q  has 
so  far  been  considered  to  be  set  of  2D  points.  However 
Equation  1  is  equally  applicable  to  the  3D  x,  y,  z  measurement 
positions  that  result  from  Autonomous  Underwater  Vehicle 
(AUV)  operations. 


Fig.  8.  The  contours  of  the  utility  surface  (in  xy-space)  of  obtaining  a  new 
“ping”  from  a  new  location  conditioned  on  two  illustrative  distributions  of 
previous  “ping”  locations.  The  linear  distributions  on  the  left  produces  two 
symmetric  peaks.  Moving  to  either  one  would  substantially  increase  the  spatial 
diversity  and  importantly  resolve  potential  left/right  ambiguity  resulting  from 
linear  motion  with  range  only  measurements.  The  circular  distribution  on  in 
the  right  hand  plot  yields  a  polar- symmetric  utility  surface  -  no  one  direction 
is  preferred  over  another  any  substantial  (but  not  excessive)  radial  motion  will 
satisfy  the  desire  to  range  from  a  maximally  spatial  diverse  set  of  locations 

IV.  The  Experiment  Setup 

For  experimental  validation  of  the  architecture  two  au¬ 
tonomous  kayaks  fitted  with  acoustic  modems  were  used. 

A.  Behavior  Configuration 

The  two  behaviors  described  in  the  previous  section  were 
configured  in  a  manner  that  retained  a  steady  priority  weight 
for  the  survey  behavior,  and  a  varying  priority  weight  for  the 
behavior  for  resolving  position  uncertainty.  When  the  position 
uncertainty  became  higher,  the  “resolve”  behavior  grew  in 
priority  until  the  vehicle  sufficiently  maneuvered  to  reduce 
uncertainty  via  an  acoustic  modem  communication. 

B.  Acoustic  Communication  and  Acoustic  Telemetry 


Fig.  9.  Basic  WHOI  Micro-modem  board  set:  power  amp  and  main  board. 
In  this  simple  low-power  configuration,  the  modem  communicates  using 
Frequency-Hopping  Frequency  Shift  Keying  (FH-FSK)  at  a  burst  rate  of 
80bps.  At  25KHz,  the  modem  can  communicate  at  ranges  greater  than  2km, 
in  the  challenging  Very  Shallow  Water  environment. 

The  autonomous  vehicles  are  equipped  with  WHOI  Micro¬ 
modems  which  can  provide  acoustic  telemetry  and  ranging. 
The  modem  is  a  small  low-power  design  based  around  the 


TABLE  I 

WHOI  Micro-modem  Specifications 


Spec. 

Description 

Units 

Prx 

Power  consumption  while  receiving 

80  mW 

Ptx 

Power  consumption  while  transmitting 

10  W 

SPLtx 

Transmitter  sound  pressure  level 

192  dB  re  /i  Pa 

A  Trx 

Receiver  timing  resolution 

125  n  sec 

TFSK 

FSK  data  rate 

80  bps 

Bfsk 

FSK  packet  data  payload 

32  bytes 

Tfsk 

FSK  packet  duration 

3.2  seconds 

Ttnodes 

Maximum  number  of  nodes  per  subnet 

16  nodes 

fc 

Band  C  center  frequency 

25  KHz 

BWC 

Band  C  Bandwidth 

4  KHz 

Texas  Instruments  TMS320C5416  fixed-point  DSP,  as  shown 
in  Fig.  9.  The  Micro-modem  was  designed  for  autonomous 
vehicles  and  long  term  sub-sea  deployed  systems  [27].  Its 
specifications  are  summarized  in  table  I. 

The  modem  provides  access  to  a  half-duplex  underwa¬ 
ter  acoustic  channel  using  a  time-domain  multiple  access 
(TDMA)  scheme  [28].  Telemetry  packet  transmissions  are 
scheduled  using  a  short  (1.1  second  duration)  cycle  ini¬ 
tialization  (Cl)  packet,  followed  shortly  thereafter  with  a 
longer  telemetry  packet.  Typically,  one  modem  implements 
a  schedule  by  transmitting  Cl  packets,  and  all  modems  in  the 
subnet  observe  the  schedule,  transmitting  their  telemetry  when 
commanded,  and  receiving  otherwise.  This  allows  modem 
users  to  share  their  telemetry  across  the  acoustic  subnet,  in 
synchronized  broadcast  fashion. 

Because  telemetry  packets  have  small  (32  byte)  data  pay- 
loads,  a  library  of  compressed  binary  messages  called  the 
Compact  Control  Language  (CCL)  has  been  developed  [29]. 
This  library  has  been  expanded  to  include  a  UTM  navigation 
message,  for  transmitting  high  precision  vehicle  location  in¬ 
formation  to  another  platform.  This  new  CCL  message  was 
used  to  convey  the  “shuttle”  kayak’s  position  estimate  to  the 
“survey”  kayak. 

In  addition,  the  Micromodem  is  capable  of  acoustic  naviga¬ 
tion.  Networks  of  acoustic  transponders  can  be  interrogated, 
estimating  range  via  travel-time  measurements,  and  modems 
can  estimate  range  to  each  other,  using  an  acoustic  network 
ping  packet.  In  this  experiment,  short  ping  packets  (also  1.1 
second  duration)  were  used  to  estimate  range  between  the 
autonomous  kayaks. 

C.  The  Marine  Vehicle  Platforms 

Experimental  validation  is  done  on  four  kayak-based  au¬ 
tonomous  surface  crafts  depicted  in  Fig.  10.  See  [30]  for  more 
details  on  this  platform. 

V.  Experimental  Results 

Two  Kayaks,  fitted  with  acoustic  modems  were  used  to 
generate  the  results  in  this  section.  One  vehicle  which  will 
ultimately  play  the  role  of  the  nav-aid  vehicle  remained  still 


Fig.  10.  A  kayak-based  autonomous  surface  craft. 
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at  the  origin  while  the  other  took  the  role  of  the  survey 
vehicle.  The  latter  executed  a  trajectory  determined  by  the 
interplay  of  two,  sometimes  contradictory  behaviors,  described 
in  Sections  III-B  and  III-C  —  the  desire  to  execute  a  simple 
survey  pattern  (in  this  case  a  “box”)  and  also  the  desire  to 
add  spatial  diversity  to  the  locations  from  which  vehicle  to 
vehicle  ranging  occurs.  The  latter  “resolve”  behavior  running 
on  the  survey  vehicle  is  modulated  by  a  quality  metric,  rj,  of  its 
own  onboard  navigation.  In  the  field  this  may  be  for  example 
horizontal  dilution  of  precision  (HDOP)  or  the  volume  of  its  1- 
sigma  position  uncertainty  ellipsoid.  When  the  quality  metric 
is  low,  greater  importance  is  attached  to  obtaining  a  nav-fix 
by  ranging  to  and  communicating  with  the  nav-aid  vehicle. 
However,  to  obtain  such  a  fix  while  avoiding  the  singular 
configurations  and  left-right  ambiguities  that  plague  range- 
only  estimation,  spatial  diversity  is  required  -  precisely  what 
the  “resolve”  behavior  was  designed  to  instigate.  In  Figure  1 1 
we  paint  the  trajectory  of  the  vehicle  with  color  to  denote  the 
onboard  navigation  quality  metric.  Blue  and  red  represent  high 
and  low  navigational  confidence  respectively  which  accord  low 
and  high  importance  to  the  resolve  behavior.  In  short,  when  the 
trajectory  is  shaded  red  the  motivation  to  execute  a  “wiggle” 
is  increased  over  that  when  it  is  blue.  The  experimental  arena 
(a  large  river)  was  not  sufficiently  large  to  allow  a  genuine 
degradation  of  navigation  to  accumulate  over  time.  Instead 
the  evolution  r]  was  simulated  by  a  zero  to  unity  ramp  with  a 
period  of  60  seconds.  Although  crude,  this  approach  does  aid 
clarity  while  illustrating  the  efficacy  of  the  IvP  framework  in 
terms  of  action  selection.  In  particular  the  importance  attached 
a  particular  behavior  is  not  coupled  to  the  effect  it  has  on  the 
system  at  large  which  would  be  the  case  if  the  resolve  behavior 
really  had  an  effect  on  r\. 


Around  the  perimeter  of  Figure  11a  selection  of  utility 
surfaces  are  rendered.  Each  depicts  the  utility  associated  with 
achieving  one  more  ping  from  a  x,y  position  conditioned 
upon  the  set  of  locations  from  which  the  survey  vehicle  has 
recently  pinged  the  stationary  nav-aid  vehicle.  The  recent  ping 
locations  are  shown  as  white  dots  superimposed  over  the  utility 
surfaces.  Note  how  the  value  of  r]  modulates  the  overall  utility. 
For  example  at  point  20  the  utility  of  a  course  change  to 
diversify  ping-locations  is  small  because  the  vehicle  possesses 
a  high  degree  of  confidence  in  its  position  estimates,  on  the 
other  hand  at  point  17  the  converse  is  true  and  we  see  a 
perturbation  occurring  in  the  ensuing  vehicle  path.  The  straight 
line  in  the  south  east  comer  is  a  result  of  an  extended  period 
of  failed  acoustics.  With  so  few  recent  ping-locations  pinging 
from  anywhere  results  in  significant  spatial  diversity  and  so 
the  vehicle  simply  executes  the  unadulterated  baseline  survey 
behavior  without  perturbation. 

VI.  Conclusions 

In  this  work  we  have  applied  a  novel  method  for  executing 
multi-objective  optimization  on  a  marine  vehicle  platform  and 
shown  its  suitability  in  resolving  conflicts  between  sensor 
requirements  and  primary  task  completion  in  the  marine 
environment.  Although  this  work  has  demonstrated  that  a 
sensor  optimization  behavior  and  survey  behavior  can  both 
be  satisfied,  much  practical  work  remains  to  implement  a 
robust  solution  for  underwater  vehicles  operating  in  the  ocean 
environment.  For  example,  underwater  survey  sensors  such 
as  sidescan  sonars  and  cameras  have  different  behavioral 
constraints  from  the  survey  behavior  presented  here.  Also, 
the  navigation  sensor  optimization  behavior  should  rely  on 
observed  variables  such  as  acoustic  link  quality,  and  navigation 
solution  quality  estimates. 
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Fig.  11.  A  rendering  of  the  experimental  results.  The  vehicle  (an  autonomous  Kayak  see  Figure  10)  executes  a  meandering  trajectory  under  the  influence  of 
two  behaviors  running  in  the  IvP  framework.  One  behavior  is  set  on  marshalling  the  vehicle  around  a  rectangular  survey  pattern  similar  to  that  in  Figure  7.  The 
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measurements  -  with  so  few  recent  ping  locations  any  motion  produces  spatial  diversity  and  the  preferred  linear  motion  of  the  survey  behavior  is  un-opposed. 
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